SPECIFICATION 



TO ALL WHOM IT MAY CONCERN: 

BE IT KNOWN THAT WE, Masaki Miura, a 
citizen of Japan residing at Kawasaki, Japan and 
Kenzo Kobayashi, a citizen of Japan residing at 
Kawasaki, Japan have invented certain new and 
useful improvements in 

VIDEO DISPLAY CONTROL METHOD, VIDEO DISPLAY 
CONTROL SYSTEM, AND APPARATUS EMPLOYED IN SUCH 
SYSTEM 

of which the following is a specification : - 
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TITLE OF THE INVENTION 

VIDEO DISPLAY CONTROL METHOD, VIDEO 
DISPLAY CONTROL SYSTEM, AND APPARATUS EMPLOYED IN 
SUCH SYSTEM 

5 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention generally relates to 
video display control methods, video display control 

10 systems, and apparatuses employed in such systems, 
and more particularly to a video display control 
method and a video display control system that 
control a camera from a remote location via a 
network and transmit video captured by the camera 

15 via the network to display the captured video, and 
an apparatus employed in such a system. 

2. Description of the Related Art 
Recently, the communication network 

environment has been shifting from circuit switching 
20 to an IP (internet protocol) network, which also 

requires video communication to be performed via the 
IP network. Video transmission as well as camera 
control is performed via a relatively low-speed IP 
network also in a video monitoring system that 
25 performs the camera control from a remote location. 

In this case, in order to perform the camera control 
accurately, it is necessary to quickly present a 
user with video obtained as a result of the camera 
control . 

30 Compressed video still has a relatively 

large amount of data. Therefore, in the low-speed 
network, the amount of data to be transmitted 
becomes large for the network bandwidth, and a delay 
is prone to be caused in the data transmission. As 

35 a result, a response to the camera control is 

delayed, thus resulting in poor camera operability. 

FIG. 1 shows an operation sequence of a 



conventional video display control method. In the 
drawing, when a control apparatus 1 performs 
operation control such as PAN, TILT, or ZOOM, the 
command signal is transmitted to a controlled 
apparatus 2 via a network. Propagation delay of a 
command (ex. 50 ms) , a video encoding time (ex. 100 
ms) , and a delay in a transmission of compressed 
video data (ex. 50 ms) sums up to approximately 200 
ms . Further, video transmission requires another 
1,000 ms. Therefore, it takes approximately 1,200 
ms before the control apparatus 1 obtains the video. 
This case is based on a transmission rate of, for 
instance, 100 Kbps, and if the transmission rate is 
25 kbps, the delay is approximately 4 seconds. 

Conventionally, response to a control 
command has been improved by reducing the amount of 
data to be transmitted by increasing a video 
compression rate or reducing a screen size. This, 
however, does not drastically improve the response, 
but has a disadvantage of causing deterioration in 
video quality. 

SUMMARY OF THE INVENTION 

It is a general object of the present 
invention to provide a video display control method, 
a video display control system, and apparatuses 
forming such a system in which the above-described 
disadvantage is eliminated. 

A more specific object of the present 
invention is to provide a video display control 
method and a video display control system that allow 
a user to be quickly presented with video obtained 
as a result of camera control and that can perform 
camera control without stress on a low-speed line, 
and apparatuses forming such a system. 

The above objects of the present invention 
are achieved by a video display control method 
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including the steps of (a) generating camera 
operation information concerning an operation of a 
camera operated by remote control, (b) transmitting 
data of video captured by the camera and the camera 
5 operation information to a control apparatus via a 
network and (c) processing the data of the video 
based on the camera operation information so that an 
image of the video can be displayed at the control 
apparatus . 

10 The above objects of the present invention 

are also achieved by a video display control system 
including: a first apparatus including a camera and 
generating camera operation information concerning 
an operation of the camera, the camera being 

15 operated by remote control; a second apparatus 
processing data of video captured by the camera 
based on the camera operation information so that an 
image of the video can be displayed; and a network 
connecting the first and second apparatuses, wherein 

20 the data of the video and the camera operation 

information is transmitted from the first apparatus 
to the second apparatus via the network. 

According to the above-described method 
and system, the data of the video transmitted via 

25 the network is processed based on the camera 

operation information transmitted via the network. 
Therefore, the image of the video can be displayed 
to a user with a reduced transmission delay, and the 
camera can be controlled without stress with a low- 

30 speed line. 

The above objects of the present invention 
are also achieved by an apparatus for a system where 
data of video and camera operation information is 
transmitted from the apparatus to a network so that 

35 an image of the video can be displayed, which 

apparatus includes a camera capturing the video, a 
camera control part controlling an operation of the 
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camera operated by remote control, a camera 
operation determination part generating the camera 
operation information from the operation of the 
camera, and a camera operation information 
5 transmission part transmitting the camera operation 
information to the network. 

According to the above-described apparatus, 
the camera operation information can be generated 
from any type of camera. 

10 The above objects of the present invention 

are further achieved by an apparatus for a system in 
which data of video captured by a camera and camera 
operation information concerning an operation of the 
camera is transmitted via a network to the apparatus 

15 so that an image of the video is displayed at the 

apparatus, which apparatus includes a camera control 
part generating a control signal for controlling the 
camera from an operation of a user and transmitting 
the control signal to the network, a video 

20 processing part processing the data of the video 
based on the camera operation information, and a 
video display part displaying the image of the video 
generated in the video processing part. 

According to the above-described apparatus, 

25 the image of the video can be displayed to the user 
with a reduced transmission delay. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Other objects, features and advantages of 
30 the present invention will become more apparent from 
the following detailed description when read in 
conjunction with the accompanying drawings, in 
which : 

FIG. 1 is a diagram showing an operation 
35 sequence of a conventional video display control 
method; 

FIG. 2 is a diagram showing a structure of 
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a first embodiment of a system to which a video 
display control method of the present invention is 
applied; 

FIG. 3 is a diagram showing an operation 
5 sequence of an embodiment of the video display 
control method of the present invention; 

FIG. 4 is a diagram for illustrating an 
operation from a generation of camera operation 
information to a generation of predicted video by 
10 video processing according to the video display 
control method of the present invention; 

FIGS. 5A through 5C are diagrams for 
illustrating camera operation determination 
performed by a camera operation determination part 
15 shown in FIG. 2; 

FIG. 6 is a block diagram of an embodiment 
of the camera operation determination part and a 
video encoding part shown in FIG. 2; 

FIG. 7 is a flowchart of an embodiment of 
20 a PAN and TILT determination operation performed by 
a motion vector computation part shown in FIG. 6; 

FIG. 8 is a flowchart of an embodiment of 
a ZOOM determination operation performed by the 
motion vector computation part; 
25 FIG. 9 is a flowchart of an embodiment of 

the video processing performed by a video processing 
part of a control apparatus shown in FIG. 2; 

FIG. 10 is a diagram showing a structure 
of a second embodiment of the system to which the 
30 video display control method of the present 
invention is applied; 

FIGS. IIA through IIC are diagrams for 
illustrating an interpolation operation for a data- 
lacking part; and 
35 FIGS. 12A through 12D are diagrams for 

illustrating the interpolation operation in a case 
of a low frame rate for transmission. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

A description will now be given, with 
reference to the accompanying drawings, of 
5 embodiments of the present invention. 

FIG. 2 shows a structure of a first 
embodiment of a system to which a video display 
control method of the present invention is applied. 
In the drawing, a control apparatus 10 by which a 

10 user controls a camera and a controlled apparatus 20 
including the controlled camera are connected by an 
IP network 30 so as to form this system structure. 

The control apparatus 10 includes a video 
display part 11, a video processing part 12, a video 

15 decoding part 13, a camera operation information 

reception part 14, a camera control part 15, and a 
network part 16. The controlled apparatus 20 
includes a video input part 21, a camera operation 
determination part 22, a video encoding part 23, a 

20 camera operation information transmission part 24, a 
camera part 25, a camera control part 26, and a 
network part 27. 

The user inputs control information such 
as PAN, TILT, or ZOOM of the camera part 25 to the 

25 camera control part 15 of the control apparatus 10. 
The network part 16 of the control apparatus 10 and 
the network part 27 of the controlled apparatus 20 
are connected via the IP network 30 so that video 
information and the control information are 

30 transmitted. Thereby, the control information from 
the camera control part 15 is supplied via the IP 
network 30 to the camera control part 26 of the 
controlled apparatus 20 so that the camera control 
part 26 controls a PAN, TILT, or ZOOM operation of 

35 the camera part 25 based on the supplied control 

information. Video captured by the camera part 25 
is taken into the video input part 21 as video data. 



and is supplied to the camera operation 
determination part 22. The camera operation 
determination part 22 determines an operation of the 
camera part 25 to be PAN, TILT, or ZOOM by 
5 processing the video data. The video encoding part 
23 performs MPEG (Moving Picture Experts Group) 
encoding on the video data supplied from the video 
input part 21. Any algorithm may be employed for 
the above-described operation determination if 

10 appropriate for an apparatus in use. The operation 
determination may be performed based on a frame rate 
of the video input part 21, that is, for instance, 
every 1/30 second, or based on a frame rate of the 
video encoding part 23, that is, for instance, every 

15 few seconds. Minuter control is performable if the 
operation determination is performed on a higher 
frame rate. 

The camera operation information 
transmission part 24 of the controlled apparatus 20 

20 supplies camera operation information supplied from 
the camera operation determination part 22 to the 
network part 27 so as to transmit the camera 
operation information to the control apparatus 10. 
The video information output from the video encoding 

25 part 23 and the camera operation information output 
from the camera operation information transmission 
part 24 may be or may not be multiplexed before 
being transmitted from the network part 27. For 
instance, a system employing the ITU-T 

30 recommendation H.323 requires such information to be 
multiplexed, but a system employing the IP does not 
especially require such information to be 
multiplexed since the IP itself has a function of 
constructing a plurality of logical paths. 

35 The camera operation information and the 

video information may be transmitted via separate 
networks. In this embodiment, the IP network 30 is 



employed as the network, but another network may be 
employed to transmit the above-described information. 
The video information output from the video encoding 
part 23 and the camera operation information output 
5 from the camera operation information transmission 
part 24 may be synchronized with or independent of 
each other in terms of time. A typical 
synchronization method transmits the camera 
operation information right before the video 

10 information. 

The camera operation information reception 
part 14 of the control apparatus 10 outputs the 
camera operation information received by the network 
part 16 to the video processing part 12. The video 

15 decoding part 13 MPEG-decodes the video information 
received by the network part 16 and supplies the 
MPEG-decoded video information to the video 
processing part 12. 

The video processing part 12, based on the 

20 camera operation information, processes the video 

data already output from the video decoding part 13 
to perform operations corresponding to the PAN, TILT, 
ZOOM of the camera part 25, and displays the video 
on the video display part 11. The video processing 

25 in the video processing part 12 may be based on or 
independent of the reception timing of the camera 
operation information. 

According to the above-described structure, 
even in a video monitoring by way of a low-speed 

30 network, the camera operation information explaining 
the motion of a whole screen is received before the 
entire video is received. Therefore, by the video 
processing, the video based on camera control can be 
generated and displayed with a reduced delay in 

35 responding to a camera control operation by the user. 
Further, since the camera operation information is 
generated by the video processing without any 



interface to the camera for obtaining the camera 
operation information, the camera operation 
information can be generated to any kind of camera. 

FIG. 3 shows an operation sequence of an 
5 embodiment of the video display control method of 
the present invention. In the drawing, when the 
control apparatus 10 performs operation control such 
as PAN, TILT, or ZOOM, the control command is 
transmitted via the network 30 to the controlled 

10 device at a time tO . One video frame information 
and camera operation information is transmitted to 
the control apparatus 10 at a time tl after the 
passage of approximately 200 ms that is the sum of 
the propagation delay of the command (ex. 50 ms) , a 

15 video encoding time {ex. 100 ms) , and a delay in 
transmitting compressed video data (ex. 50 ms) . 

Since, after the time tl, predicted video 
can be generated from preceding video frame data and 
the received camera operation information, video 

20 based on a pseudo camera operation can be displayed 
after the time tl. According to the conventional 
method, video cannot be displayed until the one 
video frame information is collected at a time t2 . 
On the other hand, in this embodiment, a time from a 

25 camera control operation by the user until a screen 
update can be reduced by 85% compared with the 
conventional method. 

Next, a description will be given of an 
operation from the generation of camera operation 

30 information to the generation of predicted video by 
the video processing according to the video display 
control method of the present invention. FIG. 4 
shows frames to be transmitted 50a through 55a in 
the controlled apparatus 20, transmitted frames 50b 

35 through 55b in the control apparatus 10, and 

displayed video frames 50c through 55c subjected to 
the video processing in the control apparatus 10. 
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The transmitted frames 50b through 55b 
fall considerably behind the frames to be 
transmitted 50a through 55a due to transmission 
delay. If a PAN operation is started from a point 
5 of the frame 53a by the user's operation of the 
control apparatus 10, the number of pixels of a 
displacement of the camera part 25 by the PAN 
operation (a displacement pixel number Zirl) can be 
obtained by detecting the motion vectors of the 

10 frame 53a from the frame 52a. The displacement 

pixel number ^rl is transmitted without waiting for 
the transmission of the video. 

Due to the transmission delay, if a 
transmitted frame is merely displayed, the video of 

15 the frame 51b is displayed in the control apparatus 

10 when the transmission of the frame 53a is started. 
However, by transmitting the displacement pixel 
number Zlrl to the control apparatus 10 
independently or right before the transmission of 

20 the video so that the video processing can be 

performed based on the displacement pixel number Zl 
rl in the control apparatus 10, video corrected by 
the displacement pixel number zjrl such as the frame 
51c can be displayed. It can be seen that the frame 

25 51c reflects the position information of the frame 

53a from a comparison among the frames 53a, 51b, and 
51c - However, the video information is not received, 
the video of the frame 51c lacks a part thereof as 
indicated by crosshatching in FIG. 4. 

30 Thereafter, similarly by transmitting in 

advance the number of pixels of a displacement of 
the camera part 25 by a PAN operation (a 
displacement pixel number Zirn) so that the video 
processing can be performed based on the 

35 accumulation and correction of the displacement 
pixel number zirn in the control apparatus 10, a 
video display eliminating the effect of the 
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transmission delay can be achieved. This allows the 
user to perform almost real-time camera control 
without the effect of the transmission delay. 

Next, a description will be given, with 
5 reference to FIGS. 5A through 5C, of a camera 
operation determination performed by the camera 
operation determination part 22 . An embodiment 
shown herein focuses on motion vectors of video. 
Generally, when the camera part 25 performs a PAN, 

10 TILT, or ZOOM operation, a motion vector is 

generated in each of macro blocks forming a screen 
as indicated by each arrow in each of FIGS. 5A 
through 5C. By detecting and processing these 
motion vectors, the motion of the camera part 25 can 

15 be determined. FIGS. 5A through 5C show a PAN 

operation case, a TILT operation case, and a ZOOM 
operation case, respectively. 

In the case of focusing on the motion 
vectors, a motion detection part employed in a 

20 commonly used DCT (discrete cosine transform) video 
encoder can be shared with the camera operation 
determination part 22 . 

FIG. 6 is a block diagram of an embodiment 
of the camera operation determination part 22 and 

25 the video encoding part 23. In the drawing, video 
data supplied from the video input part 21 is DCT- 
encoded, for instance, by the block of 8 pixels X 8 
lines in a DCT encoder 31. Each obtained DCT 
coefficient is guantized based on its target bit and 

30 visual property in a guantization part 32 so that 
information compression is performed in terms of 
space. Then, each guantized DCT coefficient is 
supplied via a coefficient prediction part 34 to a 
variable length encoder 36. In the variable length 

35 encoder 36, macro block encoding information such as 
motion vectors and encoding modes, and the guantized 
DCT coefficients are subjected to variable length 
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encoding that allocates a shorter code to data 
having a higher appearance frequency. Obtained 
variable length data is supplied to the network part 
27 . 

5 Further, the quantized information is 

inversely quantized in an inverse quantization part 
38 and is DCT-decoded in a DCT decoder 40 to be 
stored in a frame memory 41 as a reference screen. 
A motion detection part 42 supplies motion vectors 

10 obtained by detecting the motion of the screen to a 
motion vector prediction part 43, a motion 
compensation part 44, and a motion vector 
computation part 47. The reference screen read out 
from the frame memory 41 is supplied to the motion 

15 compensation part 44, which obtains macro block 
video data from the reference screen by motion 
prediction. The obtained macro block video data is 
supplied to a subtracter 45 and an adder 46. The 
subtracter 45 calculates a difference between the 

20 obtained macro block video data and the input macro 
block video data so as to obtain an prediction error 
signal. The prediction error signal is supplied via 
the DCT encoder 31 and the quantization part 32 to 
the variable length encoder 36. 

25 The motion vector computation part 47 

generates camera operation information by 
determining a camera operation based on the motion 
vectors detected in the motion detection part 42 . 
This camera operation information is supplied to the 

30 camera operation information transmission part 24 

and is output therefrom to the IP network 30 via the 
network part 27. 

FIG. 7 is a flowchart of an embodiment of 
a PAN and TILT determination operation performed by 

35 the motion vector computation part 47. In the 

drawing, in step SIO, a counter is reset to perform 
determination based on operation continuation. Next, 
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in step S12, a motion vector of each macro block is 
obtained from the motion detection part 42 , and in 
step S14, it is determined whether the motion vector 
of each macro block is generated in an x direction 
5 (in a lateral direction in the screen), that is, it 
is determined whether a camera operation is PAN. 

If the above-described condition is 
satisfied, it is determined in step S16 whether the 
motion vector of each macro block is generated in a 

10 y direction (in a vertical direction in the screen) , 
that is, it is determined whether the camera 
operation is TILT. If this condition is satisfied, 
in order to eliminate the effect of vibrations, it 
is determined in step S18 whether each macro block 

15 has a motion equal to or greater than a certain 

amount. If each macro block has a motion equal to 
or greater than the certain amount, the operation 
proceeds to step S20. If one of steps S14 through 
S16 is not satisfied, the operation returns to step 

20 SlO and repeats the above-described steps. 

In step S20, the counter is advanced only 
by one, and in step S22 , it is determined whether a 
counter value exceeds a given value T5 . If the 
counter value exceeds the given value T5 and the 

25 motion lasts for a given period of time, in step S24, 
it is determined that the camera operation is PAN or 
TILT, and information collected into one motion 
vector is transmitted with time information employed 
in the control apparatus 10 being added to the 

30 collected information. 

FIG. 8 is a flowchart of an embodiment of 
a ZOOM determination operation performed by the 
motion vector computation part 47. In the drawing, 
in step S30, the counter is reset to perform 

35 detection based on operation continuation. Next, in 
step S32 , a motion vector of each macro block is 
obtained from the motion detection part 42, and in 
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step S34, it is determined whether or not the motion 
vectors radially spread from or converge to the 
center of the video by determining whether a 
direction of a radiation in each macro block and the 
5 motion vector of each macro block fall within a 
certain angle range. 

If this condition is satisfied, in step 
S36, in order to eliminate the effect of vibrations, 
it is determined whether each macro block has a 

10 motion equal to or greater than a certain amount. 

If each macro block has a motion equal to or greater 
than the certain amount, the operation proceeds to 
step S40. If one of steps S34 and S36 is not 
satisfied, the operation returns to step S30 and 

15 repeats the above-described steps. 

In step S40, the counter is advanced only 
by one, and in step s42 , it is determined whether a 
counter value exceeds the given value T5. If the 
counter value exceeds the given value T5 and the 

20 motion lasts for a given period of time, in step S44, 
it is determined that a camera operation is ZOOM, 
and information collected into one zoom rate is 
transmitted with time information employed in the 
control apparatus 10 being added to the collected 

25 information. 

FIG. 9 is a flowchart of an embodiment of 
the video processing performed by the video 
processing part 12 of the control apparatus 10. In 
the drawing, in step S50, a cumulative value S that 

30 is an accumulation of every received motion vector, 
and a cumulative value D that is an accumulation of 
every motion vector received by the time right 
before a frame update to a presently displayed frame 
are cleared to be zero. 

35 Next, in step S52 , a motion vector of a 

vector generation time Tl (a motion vector V(T1)) is 
received, and in step S54, the motion vector V(T1) 
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is accumulated on the cumulative value S. In step 
S56, it is determined whether a frame is updated 
since the reception of a preceding motion vector. 
If the frame is updated, the operation proceeds to 
5 step S58 to set a cumulative value of the motion 

vectors received by the time right before the frame 
update in the cumulative value D. 

Then, the operation proceeds to step S60 
to offset the video based on a difference (S-D) 

10 between the cumulative values S and D, and display 

the offset video. Thereafter, the operation returns 
to step S52 and repeats the above-described steps. 
Thereby, an appropriate amount of offset can be 
obtained constantly. 

15 FIG. 10 shows a structure of a second 

embodiment of the system to which the video display 
control method of the present invention is applied. 
In the drawing, the same elements as those of FIG. 2 
are referred to by the same numerals, and a 

20 description thereof will be omitted. In FIG. 10, a 
camera control part 66 of the controlled apparatus 
20 recognizes control information (camera control 
command) supplied from the control apparatus 10 so 
as to control a PAN, TILT, or ZOOM operation of the 

25 camera part 25 based on the supplied control 
information and activate a camera operation 
determination part 62 only while the camera control 
is performed. Thereby, the camera operation 
determination part 62 processes video data to 

30 determine a camera operation to be PAN, TILT, or 
ZOOM. Since this determination operation is 
performed only while the camera operation 
determination part 62 is activated by the camera 
control part 66, a wrong recognition of the camera 

35 operation can be prevented, thus increasing accuracy 
in the camera control. 

A variety of methods can be employed as 
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the video processing method of the camera operation 
determination part 22 or 62. For instance, when the 
camera part 25 is not in motion, motion detection 
can be performed with higher accuracy by generating 
5 background subtraction video at the camera position, 
detecting a video region where no change occurs, and 
performing camera operation determination by 
employing data of the region. 

Further, since a video motion in the 

10 control apparatus 10 precedes live video, the frames 
51c through 55c each have a part lacking video data 
(a crosshatched part) as shown in FIG. 4. These 
parts can be interpolated. 

For instance, a storage part for a virtual 

15 large screen is provided in the control apparatus 10. 
In sequentially displaying on the video display part 
11 a series of video frames shown in FIG. IIB of a 
scene shown in FIG. IIA, each video frame is 
compared with video already stored in the storage 

20 part, and a presently displayed video frame is 
written to the storage part in the best fitting 
position. At this time, an operation such as 
scaling or lens distortion correction is performed 
if necessary. The position where the presently 

25 displayed video frame is to be written can be 
roughly determined by employing motion vector 
information. By repeating this writing operation, a 
virtual large screen as shown in FIG. IIC can be 
generated to be stored in the storage part for a 

30 virtual large screen. 

Further, in the case where the camera part 
25 is not in motion, video from which a moving 
object is removed can be generated by performing the 
above-described writing to the storage part for a 

35 virtual large screen and adding a background 

subtraction operation. Brightness correction is 
performed on the presently displayed video based on 
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surrounding video information so as to prevent 
variations in brightness. 

Thereafter, in the case of the preceding 
video motion in the control apparatus 10 , video data 
5 necessary for a part of the video lacking video data 
(a data-lacking part of the video) is extracted from 
the storage part for a virtual large screen to be 
employed for interpolation. Thereby, in the 
preceding video motion, the data-lacking part of the 

10 video can be naturally interpolated. The 

interpolation may be performed by filling the data- 
lacking part of the video with similar colors based 
on the analysis of a boundary between the data- 
lacking part and the remaining part of the video. 

15 The interpolation may also be performed by filling 

the data-lacking part with history video or boundary 
colors . 

In the case of a low frame rate for 
transmission, according to the conventional method, 

20 the video of a scene shown in FIG. 12A captured by a 
PAN operation lacks its continuity as shown in FIG. 
12B. On the other hand, according to the present 
invention in which the video motion in the control 
apparatus 10 precedes live video, frame 

25 interpolation is performed to display video frames 

at a rate higher than the frame rate as shown in FIG. 
12C. Here, a time interval between successive frame 
transmissions (an interframe time interval) is 
divided into given time periods zIt . A motion 

30 vector per time period zIt is obtained from 

motion vectors V obtained by the time right before 
the last frame update. The video is displayed by 
being shifted by the number of pixels of the motion 
vector /1v at every time period z^T . Thereby, the 

35 continuity of the video can be maintained. 

Also, in this case, the interpolated 
frames have data-lacking parts as shown in FIG. 12C. 
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However, by employing the above-described storage 
part for a virtual large screen, the data-lacking 
parts of the interpolated frames can be naturally 
interpolated as shown in FIG. 12D. 
5 Thus, according to the present invention, 

even in the case of a video transmission via a 
relatively low-speed network, by processing video in 
the control apparatus 10 based on camera operation 
information of the controlled apparatus 20, video 

10 having a reduced transmission delay can be presented 
to the user on the side of the control apparatus 10, 
and a video monitoring system that can perform 
camera control without stress can be constructed 
even with a low transmission rate line. 

15 The present invention is not limited to 

the specifically disclosed embodiments, but 
variations and modifications may be made without 
departing from the scope of the present invention. 
The present application is based on 

20 Japanese priority application No. 2000-350220 filed 
on November 16, 2000, the entire contents of which 
are hereby incorporated by reference. 



